feat: Add dedicated mode for UnslothService by vivekkalyan · Pull Request #577 · OpenPipe/ART

vivekkalyan · 2026-02-23T23:50:07Z

Stacked on #578 (edit: now on main)

Summary

Adds a dedicated mode to UnslothService where training and inference run on separate GPUs. Training stays in-process on the trainer GPU(s); vLLM runs as a subprocess on the inference GPU. This eliminates the sleep/wake overhead of shared mode and enables true async overlap between training and inference.

Shared mode remains the default and is unchanged. Dedicated mode is opt-in via explicit GPU ID lists in _internal_config:

_internal_config = {
    "trainer_gpu_ids": [0],
    "inference_gpu_ids": [1],
}

Architecture

Shared Mode (unchanged):
  Single GPU timeshares training ↔ vLLM via sleep/wake

Dedicated Mode (new):
  Main process (GPU 0): Training (Unsloth + PEFT)
  Subprocess  (GPU 1): vLLM server (OpenAI-compatible API)
  Communication: HTTP only
  Adapter sync: disk checkpoint + /v1/load_lora_adapter (load_inplace=true)

What's included

Config contracts (dev/model.py, dev/validate.py): trainer_gpu_ids and inference_gpu_ids with validation (non-overlapping, contiguous from 0, single inference GPU for now)
vLLM subprocess entry point (vllm/dedicated_server.py): Applies ART patches, enables tool calling and runtime LoRA updates, then starts vLLM
Dedicated mode in UnslothService (unsloth/service.py): Subprocess lifecycle (start, health check, close), adapter reload via HTTP after each train step, no sleep/wake
LocalBackend routing (local/backend.py): Detects dedicated config, sets CUDA_VISIBLE_DEVICES before model init, routes to dedicated mode
Unit tests: Config validation + dedicated server arg building

Testing

Benchmark: ART-E to convergence with both Shared and Dedicated mode:

…ated mode

…nfig

bradhilton

LGTM

vivekkalyan force-pushed the feat/dedicated-unsloth branch 2 times, most recently from 51bda4d to 7c12d57 Compare February 24, 2026 00:30

vivekkalyan changed the base branch from main to fix/ci-failures February 24, 2026 00:30

vivekkalyan changed the title ~~feat: Add dedicated unsloth~~ feat: Add dedicated mode for UnslothService Feb 24, 2026

vivekkalyan force-pushed the feat/dedicated-unsloth branch 2 times, most recently from a92f9f3 to a5cfa22 Compare February 24, 2026 01:01

vivekkalyan changed the base branch from fix/ci-failures to main February 24, 2026 01:09

vivekkalyan added 16 commits February 23, 2026 17:09

feat: Add dedicated mode config contracts

5efa6cd

test: Add unit tests for dedicated mode config validation

9db349e

feat: Add dedicated vLLM subprocess entry point

e85aa1b

feat: Add dedicated mode to UnslothService

452117c

feat: Add dedicated mode routing in LocalBackend

89152fc

fix: Handle register_lora_for_step in dedicated mode

0c59167

fix: enable tool calling in dedicated vLLM subprocess

11e5a33

fix: prevent race between checkpoint save and adapter reload in dedic…

e39c41c

…ated mode

refactor: use config builder for dedicated vLLM server args passthrough

a20d269

fix: guard train_sft against use in dedicated mode

164c7b1

fix: remove redundant log file in dedicated vLLM subprocess

d5568bb

fix: use line-buffered log file for dedicated vLLM subprocess

7203d78

fix: reject fast_inference and enable_sleep_mode in dedicated mode co…

ee666ac

…nfig

refactor: inline dedicated server config into _start_vllm_subprocess

74c9265

style: Format with ruff

4c18657

fix: Resolve ty type error

5930142

vivekkalyan force-pushed the feat/dedicated-unsloth branch from a5cfa22 to 5930142 Compare February 24, 2026 01:09

vivekkalyan requested a review from bradhilton February 24, 2026 01:19

bradhilton approved these changes Feb 24, 2026

View reviewed changes

vivekkalyan merged commit 3990b66 into main Feb 24, 2026
2 checks passed

vivekkalyan deleted the feat/dedicated-unsloth branch February 24, 2026 23:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add dedicated mode for UnslothService#577

feat: Add dedicated mode for UnslothService#577
vivekkalyan merged 16 commits intomainfrom
feat/dedicated-unsloth

vivekkalyan commented Feb 23, 2026 •

edited

Loading

Uh oh!

bradhilton left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vivekkalyan commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Architecture

What's included

Testing

Uh oh!

bradhilton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vivekkalyan commented Feb 23, 2026 •

edited

Loading